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RNA interference (RNAi) is a mechanism whereby small RNAs (siRNAs) directly control gene ex- 
pression without assistance from proteins. This mechanism consists of interactions between RNAs 
and small RNAs both of which may be single or double stranded. The target of the mechanism is 
m RNA to be degraded or aberrated, while the initiator is double stranded RNA (dsRN A) to be cleaved 
into siRNAs. Observing the digital nature of RNAi, we represent RNAi as a Minsky register machine 
such that (i) The two registers hold single and double stranded RNAs respectively, and (ii) Machine's 
instructions are interpreted by interactions of enzyme (Dicer), siRNA (with RISC complex) and poly- 
merization (RdRp) to the appropriate registers. Interpreting RNAi as a computational structure, we 
can investigate the computational meaning of RNAi, especially its complexity. Initially, the machine 
is configured as a Chemical Ground Form (CGF), which generates incorrect jumps. To remedy this 
problem, the system is remodeled as recursive RNAi, in which siRNA targets not only mRNA but 
also the machine instructional analogues of Dicer and RISC. Finally, probabilistic termination is 
investigated in the recursive RNAi system. 

1 Introduction 

RNA interference (RNAi), also known as RNA silencing, is a mechanism whereby a small interfering 
RNA (siRNA) originating from double stranded RNA (dsRNA) directly controls gene expression of a 
target mRNA (HE). The two key steps of RNAi are: 

(i) dsRNA is cleaved into small siRNA's fragments by an enzyme known as Dicer. 

(ii) A single strand of one small siRNA is recruited by the argonaute protein to form a complex called 
RISC. Using the siRNA as a template, RISC then identifies matching sequences in a target mRNA, and 
induces the mRNA to degrade or become aberrant (see the right semicircle of Figure [TJ). 

Therefore, we can regard the initiator of RNAi as dsRNA (since it supplies the siRNAs) and the target as 
mRNA (to be degraded or aberrated by a siRNA in a Watson-Crick complementary manner). 
A third step of RNAi completes a circular pathway from the target to the initiator |2][8l: 

(iii) An aberrant mRNA resulting from step (ii) becomes a template for dsRNA produced by polymeriza- 
tion of RNA-dependent RNA polymerase (RdRp) (see the left semicircle of Figured]). 

Since each step is digital and circularly linked, RNAi resembles a kind of (digital) computation. This 
observation raises the question of whether RNAi can be viewed as a digital computation. If so, what is a 
computational meaning of RNAi and how computationally complex RNAi is. The purpose of this paper 
is to address these issues. 

Firstly, we observe that RNAi can be modeled as a Minsky register machine. The Minsky register 
machine is a Turing complete model of computation, that (instead of an infinite tape for Turing machine) 
is equipped with two registers (for holding numbers) and a finite number of instructions (increment and 
decrement/jump) acting on the registers [10]. While most biological computational models to date are 
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Figure 1 : RNA interference 



based on the Turing machine model; that is, they regard DNA as analogous to a single tape Q, the 
Minsky machine interpretation proposed here is intrinsic to the RNAi mechanism, whereby RNAs can 
be single or double stranded. We first present a naive machine model of RNAi, designated RMrnai, 
in which the two registers are realized respectively as the initiator (dsRNA) and the target (mRNA) of 
RNAi. Increment/Decrement instructions on the registers represent chemical reactions mediated by en- 
zymes and proteins (e.g., RdRp and transcription/Dicer and RISC). However, the naive model lacks any 
rigorous computational language, hence requires a syntactical analysis. Capturing RNAi as a computa- 
tional structure, such analysis aims to extract the computational meaning, in particular, the complexity, 
of RNAi. 

Motivated by the work of Zavattaro-Cardelli fi31 . we describe our machine RMrnai in the calculus 
of Chemical Ground Form (CGF), which is a minimal fragment of Milner's CCS equipped with interac- 
tion rates for each channel, and hence constitutes a subset of the stochastic 7i-calculus ifTTTl . Introduced 
by Cardelli [3], CGF represents chemical kinetics by giving correspondence to a stochastic semantics 
of continuous time Markov chains. Despite its simplicity, the model sufficiently describes chemical ki- 
netics compositionally. However, the primitive description of CGF lacks any direct representation of 
zero-tests for the registers, creating a tendency for the instructions of encoded RMrnai to allow incor- 
rect jumps. To avoid such erroneous probabilistic jumping, an inhibitor must be incorporated into the 
machine instructions. Biologically, this corresponds to a process known as recursive RNAi (recRNAi), 
an extension of RNAi ||9j[T2l[T4l, whereby siRNA produced and accumulating during RNAi inhibits not 
only mRNA but also RISC and Dicer. The extension to recursive RNAi (recRNAi) is obtained by adding 
a feedback linkage to RNAi. The recRNAi is directly represented by a register machine RM rec RNAi, i R 
which si RNAs interactions are naturally interpreted as instruction inhibitors. We describe the machine 
in terms of CGF with fixed points. Probabilistic termination is then investigated in the recRNAi encoded 
system, and Turing completeness up to any degree of precision is demonstrated. 

2 A Naive Interpretation of RNAi in Minsky Register Machine 

In this section, we show that RNAi is naively interpreted as Minsky register machine ifTOll . 

Definition 2.1 (Register machine RMrnai interpreting RNAi (cf. Figure©) RNAi is inteipreted in the 
Minsky register machine RMrmai as follows: Registers r\ and r2 hold species dsRNA and mRNA re- 
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spectively so that the increment on r\ (res. r<i) produces one dsRNA (res. one mRNA) and the decrement 
on r\ (res. rq) removes one dsRNA (res.one mRNA). In biological terms, the increment on register r\ 
represents polymerization RdRp with an aberrant mRNA template, while an increment on r2 represents 
transcription. A decrement on r\ models the enzyme Dicer which cleaves dsRNA into siRNAs, and a 
decrement on r2 models the complementary degradation of mRNA by RISC. Q 
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Figure 2: Register Machine RMR NAi 



The following table displays the chemical reactions for the increment/decrement on the two registers, 
where mRNA* denotes either or mRNA afc . 





n 


n 


increment 


(polymerization) 


RdRp+mRNA"* - 


—> dsRNA 


(transcription) 


— > mRNA 


decrement 


(cleavage) 


dsRNA + Dicer — 


■+ siRNA'.? 


(degradation) 


mRNA+ RISC — > mRNA* + RISC 



Table 1 : chemical reactions 



3 RNAi as Chemical Reaction and Register Machines 

In this section, we describe the register machine RMrnai in Section |2] in terms of of CGF. Recall that 
CGF is a subset of 7i-calculus and of CCS supplemented with channel transition rates. Using three inter- 
action prefixes 71 := TV r \, and \a/ r \, CGF models collision between molecules as well as molecular 
decay. The parenthesized subscript (r) denotes the reaction rate of the channel. Collision and decay are 
described by 

(decay of molecule) • • • © Tr r yQ © • • • — > Q 

(collision of molecules) • ■ • ®lai r yQ® ■ ■ ■ | • • -®\ar r yR@ ■ ■ ■ — ► Q \ R 
Then a CGF is a pair (E,P) of a set E of reagents and a initial solution P. A reagent X,- = Mi for nam- 
ing a chemical specie and molecules Mi for describing the interaction capabilities of the corresponding 
species. Solution is a multiset of variables, which is released by interactions: 

(Reagents) E : = and X = M, E (Molecule) M : = and K .P © M (Solution) P : = and X | P 
Formally, computation of CGF is defined in terms of Labelled Transition Graph, as defined in 151 . 

Every increment instruction = Inc(ry) is formalized directly for j G {1,2} so that once the chemical 
reactions of the first row of Table Q] are complete, we proceed to the next instruction 
(Increment /; = lnc(r,-)) 

/, = RdRp | t./,-+i j' = l 
/; = mRNA | TJj+i j = 2 

'The machine interpretation assumes that the two species of dsRNA and mRNA are disconnected, so that the decrement and 
increment of either species induces no effect on the other. This assumption is justified because the synthesis of dsRNA is here 
regarded as pnmev-independent only [ 1 2|; in other words, dsRNA is directly duplicated in the absence of primer. In primer- 
dependent dsRNA synthesis, the disconnection of the two species is violated. In such scenario, siRNA triggers polymerization, 
hence enables RdRp to copy a normal mRNA. See also the author's [6| on the difference of the two syntheses. 
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The decrement operations are more subtle. Decrements on on r\ and on r 2 represent the chemical 
reactions of the second row of Table [T] which reactions ensure that Dicer and RISC interact to the entities 
in dsRNA and mRNA respectively, and thereby eliminate them. Although Dicer and RISC both induce 
decremental operations, RISC is recycled during degradation so that it is retained in the right-hand-side 
of (degradation), while the Dicer catalyst is consumed during the reaction (cleavage). 

So that the registers may be decremented, they are interpreted as follows: 

Register n dsRNA \=1a\. (siRNA | ■•• | siRNA) 

Register r 2 mRNA :=?a 2 .(r.0e T.mRNA flfo ) 

They represent that dsRNA and mRNA disappear by formation of siRNA, and by degradation or aberra- 
tion, respectively. 

If the chemical reaction occurs in the presence of dsRNA (res. mRNA), we proceed to the instruction 
Otherwise (i.e. if the reaction does not occur because dsRNA is absent (res. mRNA)), a jump 
is made to the instruction I s . Thus in a primitive description of CGF, every decremental instruction 
/, = DecJump(ry,s) is described by 
(Decrement instruction /, = DecJump(r y -,5)) 

j = \ ^ =[ai.(0\l i+1 )®x.I s with Dicer =!a!.(0 1 /j+i) 
j = 2 Ij =!fl2.(RISC|/ (+ i)0T./ v with RISC =!a 2 . (RISC |/ m ) 
The above recursive definition of RISC for j = 2 corresponds to the recycling of RISC described in the 
degradation. 

The decremental instructions so defined contain an error; that accidental jumps to I s occur even if the 
register is non-empty (i.e. in the presence of channel ?a,-). This error results from the absence of zero-test 
of the registers, a test which cannot be directly formulated in terms of CGF. Such an absence has been 
previously noted by Soloveichik et al. lTTBl , in their studies of stochastic chemical reaction networks. 
Lack of zero-test is a main origin of Turing incompleteness of CGF Ifl5l . and motivated Cardelli and 
Zavattaro to develop their Biochemical Ground Form JH as a minimalistic Turing complete extension of 
CGF. 

4 Recursive RNAi and Probabilistic Termination 

In this section, we model recursive RNAi in order to improve the defect described in Section |3l that the 
CGF machine interpretation RMrnai allows non-feasible jumps. We extend the RNAi mechanism to 
a recursive RNAi (recRNAi), whose register machine RM rec RNAi is described in terms of CGF + fixed 
points. This inteipretation guarantees a probabilistic termination of the machine. Via this extended 
mechanism, siRNAs produced and accumulating during interference targets not only mRNA but also 
Dicer and RISC. A schematic of this situation is presented in Figure [3j in which the usual RNAi are 
displayed to the left, but siRNAs are produced by both Dicer and RISC (which simultaneously degrades 
mRNA). The right hand of Figure [3] includes inhibition arrows from siRNA to Dicer and RISC. The 
mechanism is recursive because the RISC complex containing siRNA is being degraded besides acting 
as a degrading agent. The recursiveness of RNAi prevents the decrement operators of Section [3] from 
taking erroneous jumps, since siRNAs accumulating throughout the RNAi cycle work as inhibitors of the 
decrement operators. 

In recRNAi, the chemical reactions involved in Dicer and in RISC are not only those in the second 
row of Table Q] but also those in Table [2] The first row of Table [2] and (cleavage) represent reciprocal 
interactions on Dicer such that Dicer either makes dsRNA disappear by cleavage or Dicer is degraded by 
siRNA. Similar reciprocal interactions for RISC between the second row of Tableland (degradation). 
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siRNA 

(degradation of Dicer) siRNA + Dicer — > 
(degradation of RISC) siRNA + RISC — > 

Table 2: chemical reactions for recRNAi 
Figure 3: Recursive RNAi 

We next configure recRNAi as a register machine RM rec RNAi i n terms of CGF with fixed points. 
Definition 4.1 (RM rec RNAi in CGF with fixed points) 

Registers and /, = Inc(ry) are identical to those of Section[3] The decrement instruction, with incorpora- 
tion of siRNA, is 

(Decrement instruction /, = DecJump(r ; -,/ 4 )) 

/,• =!o/.(0 1 2i+i)©T.(!jJ/©TJ 4 ) = fix x .[a.(0 | I i+l )®T.(\s.X®T.I s )] 
siRNA =?s. siRNA 

In the above definition of when j = 1 (res. j = 2), the left term \aj.(0 | corresponds to Dicer 
(res. RISC) cleaving dsRNA (res. degrading mRNA), while the right term z.(\s.Ii®I s ) corresponds to 
Dicer (res. RISC) being degraded by siRNA. Hence our definition of /, intrinsically reflects the reciprocal 
interactions of Dicer and RISC, and implies a recursive RNAi process in the presence of siRNA. 
The fixed point definition of /, derives from Zavattaro-Cardelli lfl5l . but here we have highlighted a 
biological analogue of the definition. In the following, we modify slightly the results of lfl5ll to obtain 
the main theorem of this section. 

Given a state (/,, r\ = l\ , r 2 = l 2 ) of register machine and a natural number h, the solution in RM rec RMAi 
is defined by [(/,•, n =l u r 2 = h)\h ■= h \ Y\h dsRNA | ]T/ 2 mRNA | n^siRNA, where I t on the right 
hand is that of Definition 14. II 

Proposition 4.2 (correspondence of computations between machine and RM rec RNAi) Suppose a one 
step computation of register machine is given by (li,r\ = h,r 2 = h) 1 — > (Ij, r i = \^ r 2 = Then we 
have the following for the solutions of the two states of the computation: 

- If Ij = Inc(ry) or = DecJump(ry,^) with lj = 0, then the solution |(/,-,n = h,r 2 = h)\h can 

converge to the solution |(//,n =/ 1 ,F2 =4)]l with the probability 1. 

- If lj > and I{ = DecJump(ry,s), the solution [[(/;, n =h,r 2 = h)\h can reach to a solution 
| (lj ,r\ =l[,r 2 = l' 2 ) | \ for some natural number k>h + 1 with the probability > 1 — jj . 

Proof. 

We illustrate the case of /; = DecJump(r ; -,5) (direct for increment instructions), where sigma in the 
second column denote the probability that RM rec RNAi computations attain the right hand side solutions. 
The schematic in the third column displays the execution paths for the probability. 
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We now state the main theorem of this section. 
Theorem 4.3 (probabilistic termination) The following are equivalent: 

- A Minsky register machine starting from a state (Ij,r\ = l\j2 = h) terminates. 

- A CGF (RM rec RNAi> [(^/> r i = h, r 2 = h)\h) probabilistically terminates with probability greater 

than\-YX=h\- 

Proof. Note first that following the execution of a decrement instruction, the number of siRNA increases 
by at lease one. This is because at least one siRNA is produced by Dicer cleavage or by RISC (as it 
degrades mRNA). By Proposition 14.21 a computation of register machine containing d decrement in- 
structions is faithfully reproduced with probability greater than the following: (1 — — jq^) • • • (1 — 

h+kl l...+k d ) ^ Ukti ( 1 - I ) > 1 - Lktt I ' where k i - 1 is the number of si R N As produced by the corre- 
sponding decrement instruction. □ 
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